A two-phase strategy for detecting recombination in nucleotide sequences

نویسندگان

  • Cheong Xin Chan
  • Robert G. Beiko
  • Mark A. Ragan
چکیده

Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. Delineating recombination events is important in the study of molecular evolution, as inference of such events provides a clearer picture of the phylogenetic relationships among different gene sequences or genomes. Nevertheless, detecting recombination events can be a daunting task, as the performance of different recombination-detecting approaches can vary, depending on evolutionary events that take place after recombination. We previously evaluated the effects of post-recombination events on the prediction accuracy of recombination-detecting approaches using simulated nucleotide sequence data. The main conclusion, supported by other studies, is that one should not depend on a single method when searching for recombination events. In this paper, we introduce a two-phase strategy, applying three statistical measures to detect the occurrence of recombination events, and a Bayesian phylogenetic approach to delineate breakpoints of such events in nucleotide sequences. We evaluate the performance of these approaches using simulated data, and demonstrate the applicability of this strategy to empirical data. The two-phase strategy proves to be time-efficient when applied to large datasets, and yields high-confidence results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Major Sources of Genetic Differentiation Among Apricot Latent Virus (ApLV) Isolates

Background and Aims: Apricot latent virus (ApLV) is a species within Foveavirus genus (Betaflexiviridae family, Tymovirales order). Phylogenetic analyses using different ORFs nucleotide sequences divided most ApLV isolates into two clusters. However, there is little data about the sources of genetic differentiation among ApLV isolates. Materials and Methods: Partial coat protein (CP) sequences...

متن کامل

Perspective on Possible Recombination Event in Fusion Protein Gene of Newcastle Disease Viruses Isolated in Iran

Background and Aims: Newcastle disease (ND), caused by the virulent Newcastle disease virus (NDV), is one of the most important viral diseases in birds. In recent years recombination occurring throughout the NDVs genome isolated in China and Indonesia has been reported. This study was focused to investigate the recombination events in the F gene of the Iranian NDVs to generate useful data that ...

متن کامل

Evolutionary features of 8K (KDa) silencing suppressor protein of Potato mop-top virus

The cysteine-rich 8K protein of Potato mop-top virus (PMTV) suppresses host RNA silencing. In this study, evolutionary analysisof 8K sequences of PMTV isolates was studied on the basis of nucleotide and amino acid sequences. Twenty-one positively selected sites were identified in 8K codingregions. Recombination events were found in the 8K of PMTV isolates with a rate of 1.8. Totally 30 haplotyp...

متن کامل

Statistical methods of DNA sequence analysis: detection of intragenic recombination or gene conversion.

Simple but exact statistical tests for detecting a cluster of associated nucleotide changes in DNA are presented. The tests are based on the linear distribution of a set of s sites among a total of n sites, where the s sites may be the variable sites, sites of insertion/deletion, or categorized in some other way. These tests are especially useful for detecting gene conversion and intragenic rec...

متن کامل

An exact nonparametric method for inferring mosaic structure in sequence triplets.

Statistical tests for detecting mosaic structure or recombination among nucleotide sequences usually rely on identifying a pattern or a signal that would be unlikely to appear under clonal reproduction. Dozens of such tests have been described, but many are hampered by long running times, confounding of selection and recombination, and/or inability to isolate the mosaic-producing event. We intr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • South African Computer Journal

دوره 38  شماره 

صفحات  -

تاریخ انتشار 2007